Polynucleotides encoding CMV neutralizing proteins

ABSTRACT

The present invention provides recombinant polypeptides derived from CMV glycoprotein gB and truncated fragments thereof which contain at least one epitope which is immunologically identifiable with one encoded by the CMV genome. The complete characterization of the gB protein, including the identity of glycoprotein gp55, permits the production of polypeptides which are useful as standards or reagents in diagnostic tests and/or as components of vaccines. This invention provides recombinant polypeptides and recombinant polynucleotides encoding these polypeptides wherein a neutralizing epitope of gB is localized within gp55.

This application is a divisional of U.S. application Ser. No. 08/341,293, filed 15 Nov. 1994, now U.S. Pat. No. 5,547,834, which is a continuation of U.S. application Ser. No. 07/671,690, filed 26 Mar. 1991, now abandoned, which claims priority from International Application No. PCT/US89/00323 filed 26 Jan. 1989 and designated the United States, which is a continuation-in-part of U.S. application Ser. No. 07/149,715 filed 29 Jan. 1988, now abandoned.

TECHNICAL FIELD

The invention relates to recombinant human cytomegalovirus (CMV) proteins, and is directed to the production of neutralizing forms of gB protein and truncated forms thereof, their vaccine potential, and diagnostic DNA fragments thereof.

BACKGROUND OF THE INVENTION

Human cytomegalovirus (CMV) is a ubiquitous agent in human populations. Infections are generally asymptomatic, but there can be serious medical manifestations of the disease in immunocompromised individuals (transplant recipients and AIDS patients) and in congenitally infected newborns. In immunodeficient patients, primary CMV infection and reactivation of latent virus is associated with serious disease including retinitis and pneumonia. CMV infection also predisposes the patient to fungal and bacterial infections. Congenital CMV infection of the fetus occurs in about 1% (36,000) of infants born in the U.S. per year. Of these infants 10-20% will have symptomatic infection at birth or within two years of birth with a mortality rate of 10-15%. Among the survivors, many will have mild to severe neurologic complications including hearing loss, learning disabilities and mental retardation.

Vaccines that prevent or reduce CMV-associated disease are clearly needed. The CMV (Towne) strain has been tested as a vaccine candidate in normal individuals and renal transplant patients (Quinnan, Jr., G. V. et al. (1984) Am Intern Med 101:478-483); (Plotkin, S. A. 1985, CMV Vaccines, In: The Herpes Viruses vol. 4, ed., Roizman and Lopez, Plenum Press, N.Y., p. 297-312). While this vaccine appeared to have no deleterious effects and did reduce symptoms of CMV disease in transplant recipients, there are many objections to the use of experimental live attenuated virus vaccines, including the possibility of immune impairment resulting from virus infection and reports of possible association between CMV and oncogenesis.

In the absence of a complete understanding of the biology of CMV, the most rational approach to a vaccine would involve the development of subunit vaccines based upon the surface glycoproteins of the virus using recombinant viral glycoproteins which elicit neutralizing antibodies.

Like other herpesviruses, CMV specifies multiple glycoproteins (Stinski, M. (1976) J Virol 19:594-609; Pereira, L., et al. (1982) Infect Immun 36:933-942). Characterization of these have involved studies of CMV-infected cells and purified virions using polyclonal and monoclonal antibodies (Pereira, L., et al. (1984) Virology 139:73-86; Britt, W. J. (1984) Virology 135:369-378; Nowak, B., et al. (1984) Virology 132:325-338; Law, K. M., et al. (1985) J Med Virol 17:255-266; Rasmussen, L., et al. (1984) Proc Natl Acad Sci USA 81:876-880; and Britt and Auger (1986) J Virol 58:185-191).

U.S. Pat. No. 4,689,225, issued 25 Aug. 1987 and based upon the work described in the Pereira et al. references, supra, describes a method and vaccine for CMV infections using a polypeptide designated therein as glycoprotein A (gA1-A7) of cytomegalovirus. Two glycoproteins designated p130 (gp130) and p55 (gp55) (based on the molecular weights given in kilodaltons) have been partially purified and shown to elicit a neutralizing response in guinea pigs (Rasmussen, L., et al. (1985) J Virology 55:274-280). The gp130 glycoprotein appears to be a precursor to the gp55 glycoprotein.

The gB gene from CMV strain AD169 (which appears to be similar to the p130 CMV protein described by Rasmussen et al., supra) has been identified by nucleotide sequencing (Cranage, M. P. et al. (1986) EMBO J 5(11):3057-3063) with a 906 amino acid protein deduced therefrom. The gB gene product was expressed in recombinant vaccinia virus and rabbits immunized with this gene product produced antibodies that immunoprecipitate gB from CMV-infected cells and neutralize CMV infectivity in vitro (See also WO 87/05326).

Although there is much ongoing activity towards both the identification of major gylcoproteins which are the targets for viral neutralization and the development of a subunit CMV vaccine, to date, the origin of the gp55 CMV glycoprotein has not been established nor has gp55 been identified by nucleotide or amino acid sequence and therefore, no vaccine composed of the 55,000 dalton recombinant viral gB protein or any truncated recombinant polypeptide thereof has been reported. Clearly, in light of the absence of a complete understanding of the biology of CMV, it would be desirable to provide a safe, effective and economic vaccine capable of affording protection against cytomegalovirus infections, as well as to provide diagnostic reagents capable of detecting the particular immunogenic stimulus resulting from CMV infections.

DISCLOSURE OF THE INVENTION

The present invention provides recombinant polypeptides derived from the 55,000 dalton protein derived from gB and truncated fragments thereof which contain an epitope which is immunologically identifiable with one encoded by the CMV genome. A recombinant polypeptide derived from the gp55 CMV glycoprotein gB is provided in one embodiment of the invention.

The complete characterization of the gp55 protein derived from gB permits the production of polypeptides which are useful as standards or reagents in diagnostic tests and/or as components of vaccines. Since the desired polypeptide can be synthetically made in a relatively pure form or by recombinant DNA technology, the problems with other methods of immunogen and vaccine manufacture, including coproduction of competitive antigens and contaminants, are avoided.

In a preferred embodiment of the invention, the truncated gp55 gB fragment contains an epitope that is immunologically reactive with a CMV neutralizing antibody. The neutralizing antibody can be generated by techniques known in the art such as that described for monoclonal antibodies disclosed in Rasmussen et al., supra and U.S. Pat. No. 4,689,225.

Also provided in another preferred embodiment of the invention is a recombinant polypeptide encoded within CMV glycoprotein gB, which has a modified endoproteolytic cleavage site such that cleavage of the gB protein is effectively inhibited. The modification of the cleavage site is accomplished using site specific mutagenesis on the DNA encoding the polypeptide at or near the proteolytic cleavage site. Related to this aspect of the invention are the polynucleotides encoding the recombinant polypeptides.

Another aspect of the invention is a recombinant gp55 polynucleotide comprising a nucleotide sequence derived from the CMV gB gene. Related to this aspect of the invention are truncated recombinant polynucleotides containing regions encompassing nucleotides 1381 through 2040 of gp55 and nucleotides 1381 through 1938 of gp55, which regions contain an epitope which is immunologically reactive with a CMV neutralizing antibody.

Yet another aspect of the invention provides an expression system comprising host cells transformed with a vector containing the recombinant polynucleotides of the invention.

Another aspect of the invention provides a vaccine or prophylactic agent against human cytomegalovirus infection, said vaccine comprising a recombinant gp55 polypeptide derived from the CMV gB genome or the endoproteolytic cleavage site modified gB polypeptide in amounts effective to elicit viral neutralizing activity against cytomegalovirus when administered to a susceptible individual.

Still another aspect of the invention provides a vaccine against human cytomegalovirus infection, said vaccine comprising a recombinant polypeptide derived from a truncated fragment encoded within CMV glycoprotein gB wherein the truncated fragment contains an epitope which is immunologically reactive with a CMV neutralizing antibody, said recombinant polypeptide being present in an immunologically acceptable carrier in an amount effective to elicit viral neutralizing activity against cytomegalovirus when administered to a susceptible individual.

Another aspect of the invention provides for a DNA hybridization assay for detecting CMV homologous sequences in a biological sample comprising: a) incubating a biological sample with a DNA probe, which probe may be optionally labeled with an enzyme, radioactive tag or a fluorescent tag, under conditions which promote the formation of DNA duplexes, wherein said DNA probe is derived from gp55 nucleotide sequences; and b) detecting the formed DNA duplexes containing the DNA probe.

A further aspect of the invention provides an immunoassay for detecting antibodies directed against a CMV antigen in a biological specimen comprising:

(a) incubating a biological sample with a probe polypeptide under conditions which allow the formation of an antibody-antigen complex, wherein said probe polypeptide consists of the p55 CMV recombinant protein or a truncated fragment thereof and said protein or truncated fragment contains an epitope which is immunologically reactive with a CMV-neutralizing antibody; and

(b) detecting an antibody-antigen complex containing the probe antigen.

Yet another aspect of the invention provides polyclonal antibodies against the recombinant gp55 or truncated polypeptides thereof, for immune prophylaxis.

Other and further aspects of the present invention will be apparent from the following description and claims and other embodiments of the invention employing the same or equivalent principles may be used by those skilled in the art without departing from the present invention and the purview of the appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a HindIII restriction map of the CMV (Towne) genome displayed in the parental orientation. Unique sequences are denoted by a thin line, and inverted repeats of the Long (L), and Short (S) components are denoted by boxes, ab-b'a', and a'c'-ca. The a sequence, distinguished as a white box, is a terminal direct repeat with an inverted copy (a') at the L/S junction.

The lower restriction map illustrates the ˜4.96 kb BamHI E/R to HindIII D/A fragment encoding the gB gene. The Towne DNA fragment cloned into pXgB1 is shown above the line and the largely colinear AD169 fragment is shown below the line. Restriction enzyme abbreviations are B, BamHI; Bg, BglII; C, ClaI; E, EcoRI; Eg, EagI; H, HindIII; Hc, HincII; K, KpnI; P, PstI; S, SacII; Sa, SalI; X, XhoI.

FIGS. 2A-2F illustrate the nucleotide and deduced amino acid sequences for the gB envelope protein of CMV strains Towne and AD169. The gp55 cleavage site between amino acids 460 and 461 is indicated by the arrow. The N-terminal sequence analysis of gp55, which revealed this cleavage site, is shown in Table 2.

FIGS. 3A-3D are is an illustration of the mammalian cell expression vectors of the invention. Plasmids pXgB7 (4.5 kb) and pXgB8 (6.5 kb) encode a truncated version of gB cloned as a partial SacII/XhoI fragment into pSV7d, an SV40 based expression vector or pON260, a CMV-based expression vector, respectively. Plasmids pXgB12 (6.4 kb) and pXgB13 (5.5 kb) encode a full length gB gene cloned as an EagI fragment into plasmid pMIE, a CMV-based expression vector and pSV7d, respectively. Transcriptional initiation and termination elements differ among each construction.

FIG. 4 is a schematic representation of a topographical map of epitopes on CMV gB. Discontinuous neutralizing domains (domain 1=amino acids 461-619; domain 2a and 2b=amino acids 620-680) are labeled by ellipses.

MODES FOR CARRYING OUT THE INVENTION

I. Definitions

As used herein, a polynucleotide "derived from" a designated sequence, for example, the DNA from the CMV gB gene, refers to a polynucleotide sequence which is comprised of a sequence of at least 6-20 nucleotides, more preferably at least 15 to 20 nucleotides corresponding, i.e., identical to or complementary to, a region of the designated nucleotide sequence. The correspondence to the nucleic acid sequence will be approximately 70% or greater, will preferably be at least 80%, and even more preferably will be at least 90%.

The correspondence or non-correspondence of the derived sequence to other sequence can be determined by hybridization under the appropriate stringency conditions, using standard DNA hybridization technologies in liquid phases or on solid supports. Hybridization techniques for determining the complementarity of nucleic acid sequences are known in the art (see, for example, Maniatis et al. (1982)), and are discussed infra. In addition, mismatches of duplex polynucleotides formed by hybridization can be determined by known techniques, including digestion with a nuclease such as S1 that specifically digests single-stranded areas in duplex polynucleotides. Regions from which typical DNA sequences may be "derived" include but are not limited to, regions encoding specific epitopes.

The derived polynucleotide is not necessarily physically derived from the nucleotide sequence shown, but may be generated in any manner, including for example, chemical synthesis or DNA replication or reverse transcription, which methods are based on the information provided by the sequence of bases in the region(s) from which the polynucleotide is derived.

Similarly, a polypeptide "derived from" a designated sequence, for example, the truncated CMV gB glycoprotein, refers to a polypeptide having an amino acid sequence identical to that of a polypeptide encoded in the sequence, or a protein thereof wherein the portion consists of at least 5-10 amino acids, and more preferably at least 10-15 amino acids, or which is immunologically identifiable with a polypeptide encoded in the sequence.

The term "recombinant polynucleotide" as used herein to characterize a polynucleotide useful for the production of CMV diagnostics and/or subunit vaccines intends a polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all or a portion of the polynucleotide with which it is associated in nature or in the form of a library; and/or (2) is linked to a polynucleotide other than that to which it is linked in nature.

"Recombinant host cells", "host cells", "cells", "cell lines", "cell cultures", and other such terms denoting prokaryotic microorganisms or eukaryotic cell lines cultured as unicellular entities, are used interchangeably, and refer to cells which can be, or have been, used as recipients for recombinant vector or other transfer DNA, and include the progeny of the original cell which has been transfected. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to accidental or deliberate mutation. Progeny of the parental cell which are sufficiently similar to the parent to be characterized by the relevant property, such as the presence of a nucleotide sequence encoding a desired peptide, are included in the progeny intended by this definition, and are covered by the above terms.

A "replicon" is any genetic element, e.g., a plasmid, a chromosome, a virus, that behaves as an autonomous unit of polynucleotide replication within a cell; i.e., capable of replication under its own control.

A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring about the replication and/or expression of the attached segment.

"Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of coding sequences to which they are ligated. The nature of such control sequences differs depending upon the host organism; in prokaryotes, such control sequences generally include promoter, ribosomal binding site, and terminators; in eukaryotes, generally, such control sequences include promoters, terminators and, in some instances, enhancers. The term "control sequences" is intended to include, at a minimum, all components whose presence is necessary for expression, and may also include additional components whose presence is advantageous, for example, leader sequences.

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner. A control sequence "operably linked" to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences.

"Immunologically identifiable with/as" refers to the presence of epitope(s) in the non-native, i.e., artificially synthesized or recombinant protein, which are also present in and are unique to the designated CMV polypeptide(s). Immunological identity may be determined by antibody binding and/or competition in binding; these techniques are known to those of average skill in the art, and are also illustrated infra. The uniqueness of an epitope can also be determined by computer searches of known data banks, e.g. Genbank, for the polynucleotide sequences which encode the epitope, and by amino acid sequence comparisons with other known proteins.

As used herein, "epitope" refers to an antigenic determinant of a polypeptide; an epitope could comprise 3 amino acids in a spatial conformation which is unique to the epitope, generally an epitope consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such amino acids.

A polypeptide is "immunologically reactive" with an antibody when it binds to an antibody due to antibody recognition of a specific epitope contained within the polypeptide. Immunological reactivity may be determined by antibody binding, more particularly by the kinetics of antibody binding, and/or by competition in binding using as competitor(s) a known polypeptide(s) containing an epitope against which the antibody is directed. The techniques for determining whether a polypeptide is immunologically reactive with an antibody are known in the art.

The term "polypeptide" refers to the amino acid product of a sequence encoded within a genome, and does not refer to a specific length of the product, thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. This term also does not refer to post-expression modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like.

"Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, irrespective of the method used for the insertion, for example, direct uptake, transduction, or f-mating. The exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the host genome.

"Treatment" as used herein refers to prophylaxis and/or therapy.

An "individual", as used herein, refers to vertebrates, particularly members of the mammalian species, and includes but is not limited to domestic animals, sports animals, primates, and humans.

The DNA encoding the desired polypeptide, whether in fused or mature form, and whether or not containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any convenient host. Both eukaryotic and prokaryotic host systems are presently used in forming recombinant polypeptides, and a summary of some of the more common control systems and host cell lines is given in Section III.A., infra. The polypeptide is then isolated from lysed cells or from the culture medium and purified to the extent needed for its intended use. Purification may be by techniques known in the art, for example, salt fractionation, chromatography on ion exchange resins, affinity chromatography, centrifugation, and the like. See, for example, Methods in Enzymology for a variety of methods for purifying proteins. Such polypeptides can be used as diagnostics, or those which give rise to neutralizing antibodies may be formulated into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics, or for passive immunotherapy.

II. Description of the Invention

The glycoprotein which is the subject of the present invention, the 55,000 dalton glycoprotein encoded by the glycoprotein B gene, has been shown to induce neutralizing antibodies against CMV (Rasmussen, et al. 1985, supra). In particular, the polypeptides of the present invention correspond to proteins of the viral genome which are homologous to certain portions of the CMV gB envelope protein gp130 and the gp55 derived thereof.

Referring now to FIG. 2 showing the nucleotide and deduced amino acid sequences for the gB envelope protein of CMV strains Towne and AD169, the gp55 recombinant protein of the present invention is a 447 amino acid protein beginning at its amino terminus with serine at residue 461 (Ser₄₆₁) and terminating at valine residue 907 (Val₉₀₇). Truncated forms of this protein of particular interest contain substantial amino acid sequence homology to the region of gp55 which contains epitopes that are immunologically reactive with CMV neutralizing antibodies. Generally, this region of the CMV envelope protein lies within residue 461 to about residue 680. More particularly, discontinuous neutralizing domains have been localized: domain 1 spans amino acids 461-619 and domains 2a and 2b span amino acids 620-680.

In order to determine the region of gB containing these neutralizing epitopes, the nucleotide sequence of the Towne gB gene was first determined. The Towne strain was chosen because of its demonstrated safety as a vaccine. A restriction map of the analogous HindIII D fragment of CMV (Towne) was derived. A 4.96 kb HindIII to BamHI fragment from the right end of HindIII D, which was likely to encode gB (see FIG. 1), was subcloned. In FIG. 1, the restriction map for the Towne strain is compared to the same region of the AD169 strain. The nucleotide sequence of the gB region was determined from the 5' most distal PstI site to the HincII site 3' to the gB coding sequence (FIG. 1). In FIGS. 2A-2F, the gB (Towne) sequence is shown on the top line and, for comparison, the DNA sequence of the CMV (AD169) gB region is shown on the bottom line.

The Towne gB gene is encoded by an open reading frame of 2721 basepairs. Two other long open reading frames (ORF) are also present in the Towne sequence shown in FIGS. 2A-2F. The second ORF, which is out of frame with respect to the gB gene, extends from the HindIII D/A site (not shown) through the 5' untranslated region of the gB gene and terminates at nucleotide +36. The third ORF, also out of frame with respect to the gB gene, starts at nucleotide +2864 and extends through to the end of the sequence shown in FIGS. 2A-2F.

The size of the gB protein predicted from the 2721 bp ORF is 907 amino acids long and has features characteristic of a membrane protein. A potential 24 amino acid signal sequence is shown in FIGS. 2A-2F (Met₁ to Ser₂₄). The signal domain contains a hydrophobic core (Ile₅ to Val₂₃, with the exception of Asn₁₃) preceded by a charged residue (Arg₄).

Full length and truncated versions of the gB gene were cloned into plasmids suitable for expression in mammalian cells. The construction of these plasmids is described in detail in the examples which follow. Truncated forms of the gB gene were constructed by deleting amino acids 681 to 907 and 647 to 907 at the C-terminus, removing the transmembrane and C-terminal domains.

The expression of the gB gene encoded by the full length and truncated constructs was analyzed by transient expression in COS-7 cells using the virus neutralizing murine monoclonal antibody 15D8 (described by Rasmussen et al., 1985, supra) as a probe for expression. This antibody is directed against a 55 kd virion glycoprotein (gp55) and a related 130 kd (gp130) intracellular precursor. The antibody 15D8 can neutralize a wide range of clinical and laboratory strains in the presence of complement thereby establishing this gB epitope as an important target for virus neutralizing antibody.

The expression of truncated forms of the gB protein was also analyzed using panels of monoclonal antibodies described in U.S. Pat. No. 4,689,225. Of these antibodies, ten with complement-dependent and independent neutralizing activity reacted with a truncated derivative of gB (gBt) that contained 619 amino terminal residues but lacks the transmembrane and intracellular region of the molecule. Twelve antibodies reacted with a CHO cell line expressing a 680 amino C-terminal deleted gB derivative (CHO cell line 67).

In addition, a gB polypeptide having a mutagenized endoproteolytic cleavage site is provided herein. Results obtained using a calcium-specific ionophore A23187 to inhibit cleavage of the gB molecule expressed in a stable CHO cell line (67.77), indicate the feasibility of expressing a 110 kilodalton uncleaved gB protein, i.e., which lacks the transmembrane and putative cytoplasmic domains. The ability to express the gB molecule without subsequent processing permits the production of the desired protein free from other contaminating or undesirable gB products.

Mutagenesis oligonucleotides, designed to change the amino acid sequence at or near the proteolytic cleavage site in a conservative manner, are used to substitute, for example, threonine or glutamine residues for arginine or lysine, at positions -1, -2 and -4 relative to the point of cleavage after amino acid Arg₄₆₀.

These endoproteolytic cleavage site mutants, upon expression in mammalian cell expression vectors, are tested for resistance to proteolysis and radiolabeled cell lysates and conditioned medias of cells receiving these constructs are radioimmunoprecipitated with neutralizing monoclonal antibodies to analyze gB expression.

II.B. Preparation of Antigenic Polypeptides and Conjugation with Carrier

An antigenic region of a polypeptide is generally relatively small--typically 8 to 10 amino acids or less in length. Fragments of as few as 5 amino acids may characterize an antigenic region. DNAs encoding short segments of CMV gB polypeptides can be expressed recombinantly either as fusion proteins, or as isolated polypeptides. In addition, short amino acid sequences can be conveniently obtained by chemical synthesis. In instances wherein the synthesized polypeptide is correctly configured so as to provide the correct epitope, but is too small to be immunogenic, the polypeptide may be linked to a suitable carrier.

A number of techniques for obtaining such linkage are known in the art, including the formation of disulfide linkages using N-succinimidyl-3-(2-pyridyl-thio)propionate (SPDP) and succinimidyl 4-(N-maleimidomethyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Ill. (If the peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue.) These reagents create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of such disulfide/amide-forming agents are known. See, for example, Immun Rev (1982) 62:185. Other bifunctional coupling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are commercially available and include reactive esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-(N-maleimido-methyl)cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can be activated by combining them with succinimide or 1-hydroxyl-2-nitro-4-sulfonic acid, sodium salt. The foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be used.

Any carrier may be used which does not itself induce the production of antibodies harmful to the host. Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, such as latex functionalized sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino acids, such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins well known to those skilled in the art.

II.C. Preparation of Hybrid Particle Immunogens Containing CMV Epitopes

The immunogenicity of the epitopes of CMV may also be enhanced by preparing them in mammalian or yeast systems fused with particle-forming proteins such as that associated with hepatitis B surface antigen. Constructs wherein the CMV epitope is linked directly to the particle-forming protein coding sequences produce hybrids which are immunogenic with respect to the CMV epitope. In addition, all of the vectors prepared include epitopes specific to HBV, having various degrees of immunogenicity, such as, for example, the pre-S peptide. Thus, particles constructed from particle forming protein which include CMV sequences are immunogenic with respect to CMV and HBV.

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. cerevisiae (Valenzuela, et al. (1982) Nature 298:344), as well as in, for example, mammalian cells (Valenzuela, P., et al. (1984), in Hepatitis B (Millman, I., ed., Plenum Press) pp. 225-236). The formation of such particles has been shown to enhance the immunogenicity of the monomer subunit. The constructs may also include the immuno-dominant epitope of HBSAg, comprising the 55 amino acids of the presurface (pre-S) region. Neurath, et al. (1984). Constructs of the pre-S-HBSAg particle expressible in yeast are disclosed in European Patent Publication 174,444; hybrids including heterologous viral sequences for yeast expression are disclosed in European Patent Publication 175,261. Both applications are assigned to the herein assignee, and are incorporated herein by reference. These constructs may also be expressed in mammalian cells such as Chinese hamster ovary (CHO) cells using an SV40-dihydrofolate reductase vector (Michelle, et al. (1984) Int Symposium on Viral Hepatitis).

II.D. Preparation of Vaccines

Vaccines may be prepared from one or more immunogenic polypeptides encoded within the recombinant polynucleotide sequences of gB.

In addition, prophylactic agents comprising the 110 kilodalton uncleaved C-terminal truncated gB protein, are also useful to assess the effects of processing on CMV infectivity and pathogenicity. As demonstrated for several other viruses (hemagglutinin of influenza virus and HIV gp160), endoproteolytic cleavage of precursor polypeptides is an essential step in the maturation of viral peptides, that is, for viral replication and infectivity. The present endoproteolytic cleavage site mutants, in addition to eliminating the production of multiple processed forms of gB, are believed to permit the generation of a viral neutralizing response in a subject without concommitant risk of introducing an active infection.

The preparation of vaccines which contain immunogenic polypeptide(s) as active ingredients, is known to one skilled in the art. Typically, such vaccines are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The active immunogenic ingredients are often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1'-2'-dipalmitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and RIBI, which contains three components extracted from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by measuring the amount of antibodies directed against an immunogenic polypeptide containing a CMV antigenic sequence resulting from administration of this polypeptide in vaccines which are also comprised of the various adjuvants.

The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkaline glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1%-2%. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 10%-95% of active ingredient, preferably 25%-70%.

The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable salts include the acid addition salts (formed with free amino groups of the peptide) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, tartaric, maleic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.

II.E. Dosage and Administration of Vaccines

The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be prophylactically and/or therapeutically effective. The quantity to be administered, which is generally in the range of 5 micrograms to 250 micrograms of antigen per dose, depends on the subject to be treated, capacity of the subject's immune system to synthesize antibodies, and the degree of protection desired. Precise amounts of active ingredient required to be administered may depend on the judgment of the practitioner and may be peculiar to each subject.

The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedule. A multiple dose schedule is one in which a primary course of vaccination may be with 1-10 separate doses, followed by other doses given at subsequent time intervals required to maintain and or re-enforce the immune response, for example, at 1-4 months for a second dose, and if needed, a subsequent dose(s) after several months.

II.F. Preparation of Antibodies Against CMV Epitopes

The immunogenic polypeptides prepared as described above may be used to produce antibodies, both polyclonal and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, goat, guinea pig, horse, etc.) is immunized with an immunogenic polypeptide bearing a CMV epitope(s). Serum from the immunized animal is collected and treated according to known procedures. If serum containing polyclonal antibodies to a CMV epitope contains antibodies to other antigens, the polyclonal antibodies can be purified by immunoaffinity chromatography. Techniques for producing and processing polyclonal antisera are known in the art, see for example, Mayer and Walker, eds., (1987) Immunochemical Methods in Cell and Molecular Biology, Academic Press, London.

Monoclonal antibodies directed against CMV epitopes can also be readily produced by one skilled in the art. The general methodology for making monoclonal antibodies by hybridomas is well known. Immortal antibody-producing cell lines can be created by cell fusion, and also by other techniques such as direct transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See, e.g., Rasmussen et al. (1985) supra; M. Schreier, et al. (1980) Hybridoma Techniques; Hammerling, et al. (1981) Monoclonal Antibodies and T-Cell Hybridomas; Kennett, et al. (1980) Monoclonal Antibodies; see also, U.S. Pat. Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,452,570; 4,466,917; 4,472,500; 4,491,632; and 4,493,890. Panels of monoclonal antibodies produced against CMV epitopes can be screened for various properties; i.e., for isotype, epitope affinity, etc.

Antibodies, both monoclonal and polyclonal, which are directed against CMV epitopes are particularly useful in diagnosis, and those which are neutralizing are useful in passive immunotherapy. Monoclonal antibodies, in particular, may be used to raise anti-idiotype antibodies.

Anti-idiotype antibodies are immunoglobulins which carry an "internal image" of the antigen of the infectious agent against which protection is desired. Techniques for raising anti-idiotype antibodies are known in the art. See, for example, Grzych et al. (1985) Nature 316:74 and Macnamara et al. (1984) Science 226:1325. Generally, the truncated CMV recombinant peptides described herein containing CMV neutralizing epitopes would be used to generate monoclonal antibodies from which anti-idiotype antibodies could be generated. These antiidiotype antibodies may also be useful for treatment of CMV, as well as for an elucidation of the immunogenic regions of CMV antigens.

II.G. Diagnostic Oligonucleotide Probes and Kits

Using the disclosed CMV DNA as a basis, oligomers of approximately 8 nucleotides or more can be prepared, either by excision or synthetically, which hybridize with the CMV gB gene and are useful in the detection of unique viral sequences by hybridization. While 6-8 nucleotides may be a workable length, sequences of 10-12 nucleotides are preferred, and about 20 nucleotides appears optimal. Preferably, these sequences will derive from regions which lack heterogeneity. These probes can be prepared using routine methods, including automated oligonucleotide synthetic methods. For use as probes, complete complementarity is desirable, though it may be unnecessary as the length of the fragment is increased.

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, is treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic acid from the sample may be subjected to gel electrophoresis or other size separation techniques; alternatively, the nucleic acid sample may be dot blotted without size separation. The probes are then labeled. Suitable labels, and methods for labeling probes are known in the art, and include, for example, radioactive labels incorporated by nick translation or kinasing, biotin, fluorescent probes, and chemiluminescent probes. The nucleic acids extracted from the sample are then treated with the labeled probe under hybridization conditions of suitable stringencies.

The probes can be made completely complementary to the CMV gB gene. Therefore, usually high stringency conditions are desirable in order to prevent false positives. However, conditions of high stringency should only be used if the probes are complementary to regions of the viral gene which lacks heterogeneity. The stringency of hybridization is determined by a number of factors during hybridization and during the washing procedure, including temperature, ionic strength, length of time, and concentration of formamide. These factors are outlined in, for example, Maniatis, T (1982).

Generally, it is expected that the CMV genome sequences will be present in serum of infected individuals at relatively low levels, i.e., at approximately 10² -10³ sequences per ml. This level may require that amplification techniques be used in hybridization assays. Such techniques are known in the art. For example, the Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deoxynucleotide transferase to add unmodified 3'-poly-dT-tails to a DNA probe. The poly dT-tailed probe is hybridized to the target nucleotide sequence, and then to a biotin-modified poly-A. PCT application 84/03520 and EPA124221 describe a DNA hybridization assay in which: (1) analyte is annealed to a single-stranded DNA probe that is complementary to an enzyme-labeled oligonucleotide; and (2) the resulting tailed duplex is hybridized to an enzyme-labeled oligonucleotide. EPA 204510 describes a DNA hybridization assay in which analyte DNA is contacted with a probe that has a tail, such as a poly-dT tail, an amplifier strand that has a sequence that hybridizes to the tail of the probe, such as a poly-A sequence, and which is capable of binding a plurality of labeled strands. A particularly desirable technique may first involve amplification of the target CMV sequences in sera approximately 10,000 fold, i.e., to approximately 10⁶ sequences/ml. This may be accomplished, for example, by the technique of Saiki et al. (1986) Nature 324:163. The amplified sequence(s) may then be detected using a hybridization assay which is described in copending U.S. application Ser. No. 109,282, which was filed 15 Oct. 1987, is assigned to the herein assignee, and is hereby incorporated herein by reference. This hybridization assay, which should detect sequences at the level of 10⁶ /ml utilizes nucleic acid multimers which bind to single-stranded analyte nucleic acid, and which also bind to a multiplicity of single-stranded labeled oligonucleotides. A suitable solution phase sandwich assay which may be used with labeled polynucleotide probes, and the methods for the preparation of probes is described in copending European Patent Publication No. 225,807, published 16 Jun. 1987, which is assigned to the herein assignee, and which is hereby incorporated herein by reference.

II.H. Immunoassay and Diagnostic Kits

Both the recombinant polypeptides which react immunologically with serum containing CMV antibodies, and the antibodies raised against these recombinant polypeptides, are useful in immunoassays to detect the presence of CMV antibodies, or the presence of the virus, in biological samples, including for example, blood or serum samples. Design of the immunoassays is subject to a great deal of variation, and a variety of these are known in the art. For example, the immunoassay may utilize one viral antigen, for example a recombinant polypeptide derived from amino acids 461-680 of gp55; alternatively, the immunoassay may use a combination of viral antigens derived from the CMV genome. It may use, for example, a monoclonal antibody directed towards one viral antigen, a combination of monoclonal antibodies directed towards the one viral antigen, monoclonal antibodies directed towards different viral antigens, polyclonal antibodies directed towards the same viral antigen, or polyclonal antibodies directed towards different viral antigens. Protocols may be based, for example, upon competition, or direct reaction, or may be sandwich type asssays. Protocols may also, for example, use solid supports, or may be by immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays which amplify the signals from the probe are also known; examples of which are assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such as ELISA assays.

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by packaging the appropriate materials, including the recombinant polypeptides of the invention containing CMV epitopes or antibodies directed against epitopes in suitable containers, along with the remaining reagents and materials required for the conduct of the assay, as well as a suitable set of assay instructions.

The polynucleotide probes can also be packaged into diagnostic kits. Diagnostic kits include the probe DNA, which may be labeled; alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be included in the kit. The kit may also contain other suitably packaged reagents and materials needed for the particular hybridization protocol, for example, standards, as well as instructions for conducting the test.

III. General Methods

The general techniques used in extracting the genome from a virus, preparing and probing a cDNA library, sequencing clones, constructing expression vectors, transforming cells, and the like are known in the art and laboratory manuals are available describing these techniques. However, as a general guide, the following sets forth some sources currently available for such procedures, and for materials useful in carrying them out.

III.A. Hosts and Expression Control Sequences

Both prokaryotic and eukaryotic host cells may be used for expression of desired coding sequences when appropriate control sequences which are compatible with the designated host are used. Among prokaryotic hosts, E. coli is most frequently used. Expression control sequences for prokaryotes include promoters, optionally containing operator portions, and ribosome binding sites. Transfer vectors compatible with prokaryotic hosts are commonly derived from, for example, pBR322, a plasmid containing operons conferring ampicillin and tetracycline resistance, and the various pUC vectors, which also contain sequences conferring antibiotic resistance markers. These markers may be used to obtain successful transformants by selection. Commonly used prokaryotic control sequences include the Beta-lactamase (penicillinase) and lactose promoter systems (Chang, et al. (1977) Nature 198:1056), the tryptophan (trp) promoter system (Goeddel, et al. (1980) Nuc Acids Res 8:4057) and the lambda-derived P_(L) promoter (Shimatake, et al. (1981) Nature 292:128) and N gene ribosome binding site and the hybrid tac promoter (De Boer, et al. (1983) Proc Natl Acad Sci USA 69:2110) derived from sequences of the trp and lac UV5 promoters. The foregoing systems are particularly compatible with E. coli; if desired, other prokaryotic hosts such as strains of Bacillus or Pseudomonas may be used, with corresponding control sequences.

Eukaryotic hosts include yeast and mammalian cells in culture systems. Saccharomyces cerevisiae and Saccharomyces carlsbergensis are the most commonly used yeast hosts, and are convenient fungal hosts. Yeast compatible vectors carry markers which permit selection of successful transformants by conferring prototrophy to auxotrophic mutants or resistance to heavy metals on wild-type strains. Yeast compatible vectors may employ the 2 micron origin of replication (Broach, et al. (1983) Meth Enz 101:307), the combination of CEN3 and ARS1 or other means for assuring replication, such as sequences which will result in incorporation of an appropriate fragment into the host cell genome. Control sequences for yeast vectors are known in the art and include promoters for the synthesis of glycolytic enzymes (Hess, et al. (1968) J Acv Enz Reg 7:149; Holland, et al. (1978) Biotechnology 17:4900), including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980) J Biochem 255:2073). Terminators may also be included, such as those derived from the enolase gene (Holland (1981) J Biol Chem 256:1385). Particularly useful control systems are those which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol dehydrogenase (ADH) regulatable promoter, terminators also derived from GAPDH, and if secretion is desired, leader sequence from yeast alpha factor. In addition, the transcriptional regulatory region and the transcriptional initiation region which are operably linked may be such that they are not naturally associated in the wild-type organism. These systems are described in detail in U.S. Ser. Nos. 468,589, 522,909, 760,197, and 868,639, filed 22 Feb. 1983, 12 Aug. 1983, 29 Jul. 1985, and 29 May 1986 respectively, all of which are assigned to the herein assignee, and are hereby incorporated herein by reference.

Mammalian cell lines available as hosts for expression are known in the art and include many immortalized cell lines available from the American Type Culture Collection (ATCC), including HeLa cells, Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell lines including myeloma lines. Suitable promoters for mammalian cells are also known in the art and include viral promoters such as that from Simian Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV), adenovirus (ADV), human, simian, and murine CMV, and bovine papilloma virus (BPV). Mammalian cells may also require terminator sequences. Vectors suitable for replication in mammalian cells may include viral replicons, or sequences which insure integration of the appropriate sequences encoding CMV epitopes into the host genome.

Expression may also be carried out with appropriate vectors, for example, baculovirus vectors, in transformed, cultured insect cells. Methods for insect cell cultures using, for example, Spodoptera frugiperda, are well known in the art and detailed procedures for their cultivation and use can be found in A Manual of Methods for Baculovirus vectors and Insect Cell Culture Procedures by M. D. Summers and G. E. Smith, Texas Agricultural Experimental Station Bulletin No. 1555, 2nd printing February 1988, and in EPA 127,839 published 12 Dec. 1984, to Smith, G. E. et al.

III.B. Transformations

Transformation may be by any known method for introducing polynucleotides into a host cell, including, for example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct uptake of the polynucleotide. The transformation procedure used depends upon the host to be transformed. For example, transformation of the E. coli host cells with lambda-gt11 containing CMV sequences is discussed in the Example section, infra. Bacterial transformation by direct uptake generally employs treatment with calcium or rubidium chloride (Cohen (1972) Proc Natl Acad Sci USA 69:2110; Maniatis et al. (1982) Molecular Cloning; A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.). Yeast transformation by direct uptake may be carried out using the method of Hinnen, et al. (1978) Proc Natl Acad Sci USA 75:1929. Mammalian transformations by direct uptake may be conducted using the calcium phosphate precipitation method of Graham and Van der Eb (1978) Virology 52:546, or the various known modifications thereof.

III.C. Vector Construction

Vector construction employs techniques which are known in the art. Site-specific DNA cleavage is performed by treating with suitable restriction enzymes under conditions which generally are specified by the manufacturer of these commercially available enzymes. In general, about 1 microgram of plasmid or DNA sequence is cleaved by 1 unit of enzyme in about 20 microliters buffer solution by incubation of 1-2 hr at 37° C. After incubation with the restriction enzyme, protein is removed by phenol/chloroform extraction and the DNA recovered by precipitation with ethanol. The cleaved fragments may be separated using polyacrylamide or agarose gel electrophoresis techniques, according to the general procedures found in Methods in Enzymology (1980) 65:499-560.

Sticky ended cleavage fragments may be blunt ended using E. coli DNA polymerase I (Klenow) in the presence of the appropriate deoxynucleotide triphosphates (dNTPs) present in the mixture. Treatment with S1 nuclease may also be used, resulting in the hydrolysis of any single stranded DNA portions.

Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase and ATP; sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline phosphatase (BAP) or calf intestinal alkaline phosphatase to remove the 5'-phosphate and thus prevent religation of the vector; alternatively, restriction enzyme digestion of unwanted fragments can be used to prevent ligation.

Ligation mixtures are transformed into suitable cloning hosts, such as E. coli, and successful transformants selected by, for example, antibiotic resistance, and screened for the correct construction.

III.D. Construction of Desired DNA Sequences

Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as described by Warner (1984). If desired the synthetic strands may be labeled with ³² P by treatment with polynucleotide kinase in the presence of ³² P-ATP, using standard conditions for the reaction.

DNA sequences, including those isolated from cDNA libraries, may be modified by known techniques, including, for example site directed mutagenesis, as described by Zoller (1982) Nuc Acids Res 10:6487. Briefly, the DNA to be modified is packaged into phage as a single stranded sequence, and converted to a double stranded DNA with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the portion of the DNA to be modified, and having the desired modification included in its own sequence. The resulting double stranded DNA is transformed into a phage supporting host bacterium. Cultures of the transformed bacteria, which contain replications of each strand of the phage, are plated in agar to obtain plaques. Theoretically, 50% of the new plaques contain phage having the mutated sequence, and the remaining 50% have the original sequence. Replicates of the plaques are hybridized to labeled synthetic probe at temperatures and conditions which permit hybridization with the correct strand, but not with the unmodified sequence. The sequences which have been identified by hybridization are recovered and cloned.

III.E. Hybridization with Probe

DNA libraries may be probed using the procedure of Grunstein and Hogness (1975) Proc Natl Acad Sci USA 73:3961. Briefly, in this procedure, the DNA to be probed is immobilized on nitrocellulose filters, denatured, and prehybridized with a buffer containing 0-50% formamide, 0.75M NaCl, 75 mM Na citrate, 0.02% (wt/v) each of bovine serum albumin, polyvinyl pyrollidine, and Ficoll, 50 mM Na Phosphate (pH 6.5), 0.1% SDS, and 100 micrograms/ml carrier denatured DNA. The percentage of formamide in the buffer, as well as the time and temperature conditions of the prehybridization and subsequent hybridization steps depends on the stringency required. Oligomeric probes which require lower stringency conditions are generally used with low percentages of formamide, lower temperatures, and longer hybridization times. Probes containing more than 30 or 40 nucleotides such as those derived from cDNA or genomic sequences generally employ higher temperatures, e.g., about 40°-42° C., and a high percentage, e.g., 50%, formamide. Following prehybridization, 5'-³² P-labeled oligonucleotide probe is added to the buffer, and the filters are incubated in this mixture under hybridization conditions. After washing, the treated filters are subjected to autoradiography to show the location of the hybridized probe; DNA in corresponding locations on the original agar plates is used as the source of the desired DNA.

III.F. Verification of Construction and Sequencing

For routine vector constructions, ligation mixtures are transformed into E. coli strain HB101 or other suitable host, and successful transformants selected by antibiotic resistance or other markers. Plasmids from the transformants are then prepared according to the method of Clewell, et al. (1969) Proc Natl Acad Sci USA 62:1159, usually following chloramphenicol amplification (Clewell (1972) J Bacteriol 110:667). The DNA is isolated and analyzed, usually by restriction enzyme analysis and/or sequencing. Sequencing may be by the dideoxy method of Sanger, et al. (1977) Proc Natl Acad Sci USA 74:5463 as further described by Messing, et al. (1981) Nuc Acids Res 9:309, or by the method of Maxam, et al. (1980) Meth Enz 65:499. Problems with band compression, which are sometimes observed in GC rich regions, were overcome by use of T-deazaguanosine according to Barr, et al. (1986) Biotechniques 4:428.

III.G. Purification of gB Produced by CHO Cell Lines

A number of conventional protein purification techniques are available for use in the purification of gB. These procedures include, for example, chromatographic methods such as ion exchange, hydrophobic interaction, lentil lectin chromatography and gel permeation chromatography.

IV. Examples

Described below are examples of the present invention which are provided only for illustrative purposes, and not to limit the scope of the present invention.

Cells, Virus and Plasmids.

Human CMV (Towne) was obtained from E. S. Mocarski (Stanford University). Virus was grown in cultures of human foreskin fibroblast (HF) cells with Dulbecco's modified Eagle medium (DME) (Gibco Laboratories, Grand Island, N.Y.) according to the procedure of Spaete and Mocarski (1985a) J Virol 56:135-143, but supplemented with 10% fetal calf serum (FCS) (Hyclone, Logan, Utah).

Plasmid Constructions.

The HindIII D fragment of CMV (Towne), illustrated in FIG. 1, was cloned into plasmid pBR322 and designated pRL104a, which was a gift of R. L. La Femina and G. S. Hayward (Johns Hopkins University). Plasmid pXgB1, which encodes the entire gB gene, was derived from circularization of the 8.95 kb BamHI fragment of pRL104a. Thus, pXgB1 contains a 4.96 kb HindIII D/A to BamHI E/R fragment from the right end of HindIII D plus pBR322 sequences. Plasmid pXgB7 contains a truncated gB gene cloned into the expression vector pSV7d (Truett, M. A., et al. (1985) DNA 4:333-349) which contains the SV40 early promoter, origin and polyadenylation sequences, as well as sequences derived from pML. Plasmid pXgB7 was constructed by cloning gB as a 2.12 kb partial SacII/XhoI fragment into the SalI site of the pGEM-1 (Promega Biotec, Madison, Wis.) polylinker using the Klenow fragment (Boehringer Mannheim Biochemicals) to blunt the SacII site and to fill the unligated SalI site. This intermediate construct was designated pXgB6. The gB sequence was excised from the surrounding polylinker sequences of pXgB6 as a 2.13 kb XbaI/HindIII fragment and inserted into the XbaI site of pSV7d. The HindIII site was filled and ligated to the filled XbaI site of pSV7d to preserve the XbaI site at the 3'-end of gB. The resulting plasmid was designated pXgB7 and is shown in FIGS. 3A-3D.

Plasmid pXgB8 contains the same truncated gB sequences cloned into pON260, a CMV major immediate early (MIE) promoter driven beta-galactosidase (lacZ) expression vector. Plasmid pON260 is derived from pON249 (Geballe, A. P., et al. (1986) Cell 46:865-872) by removal of a BalI to SalI fragment upstream from the CMV enhancer. The 2.13 kb XbaI fragment encoding gB was excised from pXgB7 and transferred to pON260, which had been cut with XbaI and PvuII to remove all but 15 C-terminal amino acids of the lacZ coding sequences. These lacZ sequences are not expressed in pXgB8 due to the presence of an upstream stop codon. Another CMV MIE promoter based expression plasmid, PMIE was constructed to eliminate the lacZ coding sequences resident in pON260. CMV MIE promoter sequences from the first BalI site upstream of the enhancer to the SacI site 8 bp downstream of the TATA box were removed from pON260 as a 0.67 kb SalI/XbaI fragment and cloned into plasmid pSV7b, a construct resembling pSV7d, which had been digested with SalI and BglII to remove the SV40 enhancer, origin and promoter leaving the SV40 polyadenylation signals intact.

The full length gB gene was cloned into pMT11/EagI (a pBR322-derived plasmid vector described by Spaete et al., 1985a) as a 3.12 kb EagI fragment in both orientations and the plasmids were designated pXgB9 and pXgB11. The gB sequences were excised from plasmid pXgB11 using the EcoRI and BamHI sites in the polylinker and cloned into PMIE polylinker sequences at EcoRI and XbaI. The resulting plasmid was designated pXgB12 and is illustrated in FIGS. 3A-3D.

The EcoRI/BamHI fragment used to generate pXgB12 was also cloned into the polylinker sequences of pSV7d cut with EcoRI and BamHI. This SV40 expression plasmid was designated pXgB13 and is also illustrated in FIGS. 3A-3D.

The gB gene cloned in pXgB6 was deleted by removing 1106 bp of N-terminal gB coding sequences between the AatII site and the NdeI site. The ends were blunted using the Klenow fragment and religated to create a SnaB1 site and preserve the reading frame. This plasmid was designated pXgB19. A 1036 bp XbaI/HindIII fragment encoding the deleted gB gene was excised from pXgB19 and cloned into the unique SalI site of pMCMVAdhfr using Klenow to fill the sites prior to ligation of the blunt ends. The expression vector, pMCMVAdhfr, is colinear with pCMVAdhfr, described below, except that the human CMV promoter has been substituted by the murine CMV (MCMV) immediate early promoter cloned as a HpaI/PstI fragment.

To develop plasmids expressing uncleaved gB, the endoproteolytic cleavage site of gB is mutagenized in vitro using M13 cloned templates and the four mutagenesis oligonucleotides described below: ##STR1## A search of the gB and M13 sequences has revealed no potential binding sites for these 36 mers other than the cleavage site. A sequencing primer,

5'3 CGC CCG GTT GAT GTA ACC GCG 3',

which lies 93 bp from the cleavage site, is also generated. The template strand is primed with each of the mutagenesis oligonucleotides followed by elongation. The resulting dsDNA is used to transform a suitable M13 host strain and the mutagenized DNAs isolated by sequencing to generate replicative form (RF) DNA. RF DNA is digested with EcoRI and ApaLI and these fragments are exchanged for wild type segments in the gB expression plasmid pXgB23 (see below), or in a similar gB construct where transcription is promoted by the murine CMV immediate early promoter.

An expression vector, pCMVAdhfr, employing the human CMV major immediate early (MIE) promoter and also containing the mouse dhfr cDNA linked to the adenovirus major late promoter (Stuve, et al. (1987) J Virol 61:326-335), was used to clone a 2196 bp EagI/XhoI gB fragment as a BamHI/XhoI fragment taken from pXgB9. This gB construct, pXgB23, has an insert identical at the 5' end to the gB insert of pXgB12 and pXgB13 in that it contains 153 bp of 5'-untranslated gB leader sequence. The construct is identical at the 3' end to the gB insert of pXgB8 in that it is truncated at the C-terminus by the deletion of amino acids 681-907 removing the transmembrane domain and C-terminal domains.

All bacterial cloning was done in Escherichia coli HB101 or DH5alpha according to the procedure of Spaete et al., (1985b) J Virol 54:817-824. Procedures used for preparation of plasmid DNA and restriction enzyme analyses are also described in Spaete et al., 1985b, supra. All plasmids used in transfections were banded twice in cesium chloride gradients. Restriction enzymes and T4 DNA ligase were purchased from New England Biolabs or Bethesda Research Laboratories (BRL) and were used according to the manufacturer's specifications.

Nucleotide Sequence Determination and Analysis.

DNA fragments were subcloned into M13 phage vectors mp18 and mp19 (Pharmacia, Piscataway, N.J.) as well as polylinker derivatives of these vectors, plasmids rt1 and rt2. Plasmid rt1 contains a polylinker with the following restriction enzyme sites in the order given: HindIII, XbaI, EcoRV, SalI, SphI, BamHI, NcoI, PstI, KpnI, SstI, EcoRI. In rt2 the site order in the polylinker is reversed. Single-stranded viral DNA was generated as template for sequencing by the dideoxy nucleotide chain-termination method of Sanger, F., et al. (1977) Proc Natl Acad Sci USA 74:5463-5467. The dGTP base analog, 7-deaza dGTP (American Bionetics, Hayward, Calif.; Boehringer Mannheim Biochemicals), was used to resolve compressed regions (regions with high G/C content). The DNA was sequenced in its entirety on both strands and all junctions were bridged using oligonucleotide primers synthesized on an Applied Biosystems 380A synthesizer.

DNA Transfections.

COS-7 cells (Gluzman, Y. (1981) Cell 23:175-182) were transfected as described by Spaete et al., 1985. Briefly, 10 to 35 ug of plasmid DNA was mixed with 1.4 ml DME-50 mM Tris hydrochloride (pH 7.4) containing 400-600 ug of DEAE dextran per ml and added to 6 cm dishes containing cells at 50-80% confluency. Cells were washed with DME-50 mM Tris hydrochloride (pH 7.4) at 4-6 h posttransfection and incubated in DME-10% FCS at 37° C. After 24 hr, a portion of the transfected cells were subcultured into 4-chamber plastic slide wells (Lab-Tek) for immunofluorescence studies. Other dishes of cells were allowed to grow to confluence and conditioned media was harvested at 72 hr posttransfection.

A DHFR-deficient CHO cell line (Urlaub and Chasin (1980) Proc Natl Acad Sci USA 77:4216-4220) was cotransfected as described in Stuve, et al., supra, using plasmids pXgB8 and Ad-dhfr. Selective medium, consisting of DME with 10% dialyzed fetal calf serum and supplemented as described in Pachl, et al. (1987) J Virol 61:315-325), was applied to the transfected cells at 2 days post-infection. Several dhfr positive clones were analyzed for gB expression by immunofluorescence and ELISA of conditioned media. Stable cell lines secreting gB were examined and the highest producing clone expressed gB at a level similar to that detected in COS cells.

It is also possible to increase gB expression on these stable cell lines using methotrexate (MTX) amplification as taught in the art.

Immunofluorescence.

COS-7 cells producing gB were identified by indirect immunofluorescence using the murine monoclonal 15D8 (Rasmussen et al., 1985) as the primary antibody and FITC-conjugated goat anti-mouse IgG (Tago, Inc., Burlingame, Calif.; Chemicon, El Segundo, Calif.) as the secondary antibody. The FITC conjugates were used at dilutions of 1:50 (Tago) and 1:80 (Chemicon). Slides were observed using a Leitz Dialux 20 EB fluorescent microscope.

Expression of gB was detected by 15D8 in COS cells transfected with all four gB expression plasmids, indicating that the p130 and p55 glycoproteins are encoded by the gB gene. Transfected cells which received truncated versions of gB exhibited a diffuse cytoplasmic immunofluorescent staining pattern. In contrast, cells transfected with the full length gB gene showed a punctate cytoplasmic staining pattern, which suggests a membrane association due to the presence of the transmembrane domain in these constructs.

ELISA Assay for gB.

Microtiter plates (Immulon 1, Dynatech Laboratories, Inc.) were coated with murine monoclonal 15D8 gamma globulin (0.1 ug/well) diluted in 50 mM sodium borate (pH 9.1) and incubated for 2 hr at 37° C. The plates were washed, incubated for 1 hr with phosphatebuffered saline (PBS; 0.15M NaCl, 2.7 mM KCl 15.3 mM Na₂ HPO₄, 1.5 mM KH₂ PO₄) plus 2.0% BSA and then incubated overnight at 37° C. with conditioned media from transfected COS cells or a mixture of CMV glycoproteins (described below) which included gB. Washed plates were then incubated for 1 hour at 37° C. with a human anti-CMV serum (Whitaker M. A. Bioproducts, Inc.), followed by incubation for 1 hour at 37° C. with a 1:500 dilution of peroxidase-conjugated goat anti-human IgG (Cooper Biomedical, Inc.). The plates were developed with 0.83 mg/ml O-phenylenediamine in 0.1M citrate-phosphate buffer (pH 5.0) plus 0.015% H₂ O₂, the reaction stopped with 4M H₂ SO₄, and the absorbance read at 490 nm. After each incubation with antigen or antibodies, the plates were washed 5 times with PBS plus 0.05% Tween 20 and 0.1% BSA and 5 times with PBS alone for the final wash. All dilutions of antigens and antibodies were made in PBS plus 0.05% Tween 20 and 0.5% BSA.

The CMV glycoprotein mixture used as a standard for the ELISA was prepared by infecting approximately 4×10⁸ HF cells with CMV (Towne) at a MOI of 0.2. Seven days after infection, the cells were lysed in 40 ml of lysis buffer (LB) containing 150 mM NaCl, 20 mM Tris pH 7.5, 1% NP40, 0.5% DOC, 1 mM PMSF, 1 ug/ml pepstatin and 17 ug/ml aprotinin. The lysate was passed over a column of lentil lectin Sepharose-4B (Sigma Chemical Co., St. Louis, Mo.) equilibrated in LB. The column was washed in LB plus 0.5M NaCl and bound glycoproteins eluted in LB plus 0.5M NaCl and 1.0M alpha-methylmannoside.

Conditioned media was collected from the transfected cells containing the truncated gB gene and analyzed for the presence of secreted gB protein by the indirect ELISA specific for CMV gB. As expected, gB protein was detected in media taken from cells expressing the truncated version (pXgB7 and pXgB8) of the protein as provided by the data in Table 1 below.

                  TABLE 1                                                          ______________________________________                                         Expression of Truncated gB in COS-7 Cells.sup.a                                           Relative Absorbance                                                                         Fold                                                   Plasmid    Values       Enhancement                                            ______________________________________                                         pSV7d      0.03         --                                                     pXgB7      0.17          5.7                                                   pXgB8      1.04         34.7                                                   ______________________________________                                          .sup.a COS7 cells were transfected with CMV gB expression plasmids and at      72 h posttransfection conditioned medium was collected and the presence o      gB was determined by ELISA using the mouse monoclonal 15D8. The absorbanc      values were taken from the average of two determinations of equivalent         dilutions which were within the linear portion of a standard curve. The        standard curve was derived using a mixture of lentil lectin purified CMV       glycoproteins, which included gB, isolated from infected cells.          

Proteolytic Cleavage Inhibition Studies.

Both cleaved (93 kDa and 31 kDa) and uncleaved (110 kDa) forms of gB are secreted from a CHO cell line (line 67.77) transformed with plasmid pXgB8. Cell line 67.77, expressing a truncated secreted form of gB and negative cell line 5--5, were radiolabeled with ³⁵ S-methionine for 2 h in DME medium or REM (reinforced Eagle's medium) lacking calcium with or without the addition of the calcium-specific ionophore A23187 at increasing concentrations (0.062 uM to 0.25 uM). The cells were chased with unlabeled media for 4 h, lysates and media were immunoprecipitated with MAb 15D8, subjected to 12% SDS-PAGE and autoradiographed. The dose of A23187 which most completely inhibits gB cleavage is 0.25 uM. The results clearly indicate that the 93 kDa and 31 kDa gB cleavage fragments were chased into the 110 kDa precursor with increasing drug concentration, however cleavage of the precursor was not completely inhibited.

These results indicate that (i) the 110 kDa precursor observed by radioimmunoprecipitation and Western blot analysis does represent inefficiently cleaved precursor and not an unreduced complex of cleavage products; (ii) the uncleaved precursor is recognized by the conformation dependent virus neutralizing MAb 15D8 demonstrating that the native structure of this important epitope is maintained in the 110 kDa molecule; and (iii) the ability to chase the 93 kDa and 31 kDa cleavage products into the 110 kDa precursor with increasing concentrations of drug establishes the precursor/product relationship of these fragments and demonstrates that the 93 kDa fragment represents the N-terminus of gB. The identity of the 31 kD molecules as the C-terminal fragment is established by amino acid sequence analysis and is described below. Since an uncleaved gB molecule will be simpler to purify from CHO conditioned media as compared to the partially cleaved complex currently being purified from CHO cell line 67.77, a proteolytic cleavage site gB mutant facilitates the isolation and purification of this important molecule.

N-terminal Amino Acid Sequence of gp55 and Determination of the gp55 Cleavage Site in gB.

Glycoprotein B was purified by passing clarified cell lysate from CMV-infected human foreskin fibroblasts over an immunoaffinity column prepared with monoclonal antibody 15D8. The proteins bound to the column were eluted with ammonium thiocyanate and concentrated by precipitation with trichloroacetic acid. The proteins were then separated on a 10% preparative SDS-polyacrylamide gel, followed by electrophoretic transfer of the proteins onto an Immobilon membrane (Millipore). The membrane was stained with Coomassie blue to locate the transferred proteins. The gp55 band was excised and used for sequence determination by Edman degradation using a gas phase protein sequencer (Applied Biosystems, Foster City, Calif.). Phenylthiohydantoin (PTH) residues were identified by C18 reverse-phase high-pressure liquid chromatography.

The resulting sequence analysis of the amino acids at the N-terminus of gp55 is shown in Table 2 and localizes the cleavage site to the peptide bond following the dibasic residues Lys₄₅₉ Arg₄₆₀. The cleavage site is shown on FIGS. 2A-2F as a bold arrow directed between Arg₄₆₀ and Ser₄₆₁.

                  TABLE 2                                                          ______________________________________                                         Sequence analysis of amino acids                                               at the N-terminus of gp55                                                             Predicted      Observed Yield                                           Cycle  Residue.sup.a  Residue  (pmol).sup.b                                    ______________________________________                                         1      S              S        68                                              2      T              T        54                                              3      D              D        63                                              4      G              G        75                                              5      N              N        50                                              6      N              N        .sup. 20.sup.c                                  7      A              A        68                                              8      T              T        29                                              9      H              H        11                                              10     L              L        85                                              ______________________________________                                          .sup.a The amino acid sequences are based on nucleotide sequences from th      Towne strain of CMV (see FIG. 2).                                              .sup.b Picomoles of phenylthiohydantoin (PTH) amino acid uncorrected for       background or lag.                                                             .sup.c The low yield of PTHasparagine may indicate the presence of             glycosylation at this site.                                              

Deletion Mapping of the gB Neutralizing Epitope Recognized by Monoclonal 15D8.

The truncated version of the gB gene encoded by pXgB7 was used to generate additional C-terminal deletions in the gp55 region of gB. A deletion plasmid, pXgB16, which eliminated 34 amino acids was generated by removing a 102 bp SalI/XhoI fragment encompassing amino acids 647-680. This DNA fragment was 5' proximal to the XhoI site used in the construction of pXgB7. A second deletion plasmid, pXgB17, deleted a 186 bp BglII/XhoI fragment encoding 62 amino acids which encompassed amino acids 619-680. Thus, pXgB16 and gXgB17 express processed/truncated proteins of 186 amino acids (residues Ser₄₆₁ to Asp₆₄₆) and 158 amino acids (residues Ser₄₆₁ to Ile₆₁₈), respectively.

A third deletion plasmid, pXgB18, eliminated 369 amino acids and was generated by removing a 1106 bp fragment encompassing amino acids 43-411 from pXgB8. The gB insert is identical to that described for pXgB22.

The ability to detect expression of these truncated constructs was analyzed after transient expression in COS-7 cells by ELISA using monoclonal 15D8 as a probe as described above. As shown in Table 3, expression was detected in media conditioned by cells transfected with pXgB7 as expected from earlier results. Expression was also detected in media conditioned by the cells transfected with pXgB16 expressing the 186 amino acid truncated gp55 fragment and with pXgB18 expressing the 287 amino acid gB fragment not counting the cleaved signal peptide. Expression was not detected in media from cells receiving the control plasmid, pSV7d, or from cells receiving pXgB17 which should express the 158 amino acid gp55 fragment. However, expression from pXgB18 was detected by immunofluorescence of transfected COS cells, indicating that 15D8 could recognize this N-terminal truncated construct. These results indicate that the 15D8 epitope maps within the 186 amino acid gp55 fragment encoded by pXgB16 and by pxgB18, and further that deletion of an additional 28 amino acids from the C-terminus of this fragment must remove a portion of the epitope essential for reactivity with 15D8.

                  TABLE 3                                                          ______________________________________                                         Mapping of the gB Neutralizing Epitope.sup.a                                   Plasmid   Relative Absorbance Values                                                                      Expression                                          ______________________________________                                         pSV7d     0.00             -                                                   pXgB7     0.24             +                                                   pXgB16    0.08             +                                                   pXgB17    0.00             -                                                   pXgB18    0.00             .sup. +.sup.b                                       ______________________________________                                          .sup.a See Table 1, footnote a.                                                .sup.b Measured by immunofluorescence.                                   

Additional experiments were run using the truncated gB products to determine whether a panel of monoclonal antibodies produced against a family of CMV glycoproteins, previously designated gA1-gA6 (U.S. Pat. No. 4,689,225), reacted with the proteins expressed by the plasmids described above.

Transient expression experiments with COS cells transfected with plasmid encoding sequences as described for pXgB17, which lacks 289 carboxyl-terminal amino acids of gB, showed immunofluorescent reactivity of this gB truncated derivative with 10 independently derived monoclonal antibodies (see FIG. 4 and Banks et al. (1989) J Gen Virol (In press)). Eight of the reactive antibodies neutralized virus in the presence of complement, whereas one did not require complement for neutralization.

It was also determined that 12 additional antibodies reacted with a stable cell line producing a gB derivative (pXgB8) lacking 227 carboxyl-terminal amino acids of gB.

These results establish both the identity and location of neutralizing domains of the gB molecule and confirm the virus neutralizing characteristics of the gB truncated proteins described herein.

Modifications of the above-described modes for carrying out the invention that are obvious to those of skill in the technical fields related to the invention are intended to be within the scope of the following claims. 

We claim:
 1. A recombinant polynucleotide encoding a C-terminally truncated polypeptide of a cytomegalovirus (CMV) glycoprotein B (gB) polypeptide which contains a neutralizing domain of gp55 that is immunologically reactive with a CMV neutralizing antibody, said polypeptide comprising a modified gp55 endoproteolytic cleavage site such that cleavage of the gB protein at said endoproteolytic cleavage site is inhibited.
 2. The recombinant polynucleotide of claim 1 wherein the modification of the endoproteolytic cleavage site comprises a substitution of one or more amino acids at or near the proteolytic cleavage site.
 3. The recombinant polynucleotide of claim 2 encoding a gB polypeptide wherein one or more of the amino acids arginine, lysine and arginine, occurring at positions -1, -2 and -4, respectively, relative to the point of cleavage, are substituted with threonine, glutamine or threonine.
 4. The recombinant polynucleotide of claim 1 encoding the truncated polypeptide which lacks the transmembrane and cytoplasmic domains of CMV gB.
 5. The recombinant polynucleotide of claim 4 encoding the truncated polypeptide of amino acid residues 1 through 680 of CMV gB.
 6. The recombinant polynucleotide of claim 4 encoding the truncated polypeptide of amino acid residues 1 through 646 of CMV gB.
 7. A recombinant polynucleotide encoding a cytomegalovirus (CMV) glycoprotein B (gB) which comprises a modified endoproteolytic cleavage site such that cleavage of the gB protein between amino acids 460 and 461 is inhibited.
 8. The recombinant polynucleotide of claim 7 encoding a gB polypeptide comprising a modified endoproteolytic cleave site wherein the modification comprises a substitution of one or more amino acids at or near the proteolytic cleavage site.
 9. The recombinant polynucleotide of claim 8 encoding a gB polypeptide wherein one or more of the amino acids arginine, lysine and arginine, occurring at positions -1, -2 and -4, respectively, relative to the point of cleavage, are substituted with threonine, glutamine or threonine.
 10. A recombinant polynucleotide encoding a C-terminally truncated polypeptide of a cytomegalovirus (CMV) glycoprotein B (gB) polypeptide which contains a neutralizing domain of gp55 that is immunologically reactive with a CMV neutralizing antibody, wherein said truncated polypeptide comprises a modified endoprotelytic cleavage site such that cleavage of the polypeptide between amino acids 460 and 461 is inhibited.
 11. The recombinant polynucleotide of claim 10 encoding the truncated polypeptide which lacks the transmembrane and cytoplasmic domains of CMV gB.
 12. The recombinant polynucleotide of claim 11 encoding the truncated polypeptide of amino acid residues 1 through 680 of CMV gB.
 13. The recombinant polynucleotide of claim 11 encoding the truncated polypeptide of amino acid residues 1 through 646 of CMV gB.
 14. A recombinant expression vector comprising the polynucleotide of claim
 1. 15. A recombinant expression vector comprising the polynucleotide of claim
 7. 16. A recombinant expression vector comprising the polynucleotide of claim
 10. 17. A host cell transformed with the vector of claim
 14. 18. A host cell transformed with the vector of claim
 15. 19. A host cell transformed with the vector of claim
 16. 